AIbase
Home
AI Tools
AI Models
MCP
AI NEWS
EN
Model Selection
Tags
Efficient Vision-Language Understanding

# Efficient Vision-Language Understanding

Tinyllava Phi 2 SigLIP 3.1B
Apache-2.0
TinyLLaVA-Phi-2-SigLIP-3.1B is a small-scale large multimodal model with 3.1B parameters, combining the Phi-2 language model and SigLIP vision model, outperforming some 7B models.
Image-to-Text Transformers
T
tinyllava
4,295
16
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
English简体中文繁體中文にほんご
© 2025AIbase